Inductive Entity Typing Alignment
نویسندگان
چکیده
Aligning named entity taxonomies for comparing or combining di↵erent named entity extraction systems is a di cult task. Often taxonomies are mapped manually onto each other or onto a standardized ontology but at the loss of subtleties between di↵erent class extensions and domain specific uses of the taxonomy. In this paper, we present an approach and experiments for learning customized taxonomy alignments between di↵erent entity extractors for di↵erent domains. Our inductive data-driven approach recasts the alignment problem as a classification problem. We present experiments on two named entity recognition benchmark datasets, namely the CoNLL2003 newswire dataset and the MSM2013 microposts dataset. Our results show that the automatically induced mappings outperform manual alignments and are agnostic to changes in the extractor taxonomies, implying that alignments are highly contextual.
منابع مشابه
A Novel Approach to Unsupervised Grapheme–to–phoneme Conversion
Automatic, data-driven grapheme-to-phoneme conversion is a challenging but often necessary task. The top-down strategy implicitly adopted by traditional inductive learning techniques tends to dismiss relevant contexts when they have been seen too infrequently in the training data. This paper proposes instead a bottom-up approach which, by design, exhibits better generalization properties. For e...
متن کاملMolecular typing of toxigenic Clostridum perfringens isolated from sheep in Iran
In this research a molecular method based on polymerase chain reaction for typing of Clostridium perfringens was developed and toxin genotypes of 64 isolates from sheep and goats in Iran were determined. The PCR assays were developed for detection of alpha (cpa), beta (cpb) and epsilon (etx) toxin genes, allowing classification of the isolates into genotypes A B, C and D. The field isolates ...
متن کاملMulti-source named entity typing for social media
Typed lexicons that encode knowledge about the semantic types of an entity name, e.g., that ‘Paris’ denotes a geolocation, product, or person, have proven useful for many text processing tasks. While lexicons may be derived from large-scale knowledge bases (KBs), KBs are inherently imperfect, in particular they lack coverage with respect to long tail entity names. We infer the types of a given ...
متن کاملPolymerase chain reaction typing of Pasteurella multocida capsules isolated in Iran
Capsules from a range of pathogenic bacteria are the key determinants of virulency. The capsule hasbeen implicated in virulence of Pasteurella multocida. In this study a type-specific polymerase chain reaction(PCR) assay was used for capsular typing of 39 avian P. multocida isolates from Iran. The PCR amplified afragment of 1044 bp from all of tested isolates. It was found that all avian P. mul...
متن کاملStructural Recursive Definitions in Type Theory
We introduce an extension of the Calculus of Construction with inductive and co-inductive types that preserves strong normalisation for a lazy computation relation. This extension considerably enlarges the expressiveness of the language, enabling a direct translation of recursive programs, while keeping a relatively simple collection of typing rules.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014